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Abstract 

The classical result in the theory of random graphs, proved by Erdos and Renyi in f 960, concerns 
the threshold for the appearance of the giant component in the random graph process. We consider 
a variant of this problem, with a Ramsey flavor. Now, each random edge that arrives in the sequence 
of rounds must be colored with one of r colors. The goal can be either to create a giant component 
in every color class, or alternatively, to avoid it in every color. One can analyze the offline or online 
setting for this problem. In this paper, we consider all these variants and provide nontrivial upper 
and lower bounds; in certain cases (like online avoidance) the obtained bounds are asymptotically 
tight. 



1 Introduction 

Let Gr n ,m be the Erdos-Renyi random graph with n labeled vertices and m randomly chosen edges. 
A celebrated result of Erdos and Renyi, probably the single most important result in the theory of 
random graphs, discovered a threshold for the appearance of the giant component in this random 
model. Erdos and Renyi proved that if m < (1 — e) '^ for a constant e > 0, then whpQ the random 
graph G nt m has all of its connected components of order at most logarithmic in n; on the other hand, 
if m > (1 + e)f then whp G n ^ m has a unique connected component of linear size, the so called giant 
component, while all other components are at most logarithmic in size. This result can be formulated 
equivalently in terms of the random graph process: if the process starts with the empty graph Go on 
n vertices, and at stage i > 1 a random missing edge is added to Gj_i to form Gj, then after the first 
(1 — e)f| rounds the resulting graph typically has all connected components of at most logarithmic 
size, while after (1 + e)5 rounds whp the unique giant component is born, while all other components 
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are of size O(logre). Since then, there have been numerous extensions to this fundamental result. One 
further ramification is considered in this paper. 

Recently, quite a lot of attention and research effort has been devoted to controlled random graph 
processes. In processes of this type, an input graph or a graph process is usually generated fully 
randomly, but then an algorithm has access to this random input and can manipulate it in some well 
defined way (say, by dropping some of the input edges, or by coloring them), aiming to achieve some 
preset goal. There is usually the so called online version where the algorithm must decide on its course 
of action based only on the history of the process so far and without assuming any familiarity with 
future random edges, and the offline version, where the algorithm has access to the whole history of the 
process and makes its decisions based on the full knowledge of the process. We will give corresponding 
accurate definitions for our setting later. 

Applied to the question about the appearance of the giant component, the first such version 
chronologically is probably the so-called Achlioptas process. This process is named after Dimitris 
Achlioptas, who posed the following question about 10 years ago. Suppose random edges arrive in 
pairs, and an online algorithm can choose one of them, put it into the graph, and return the other 
edge to the pool. Is it possible to design an algorithm that whp delays the appearance of the giant 
components for noticeably longer than the Erdos-Renyi 0.5n steps? This question was answered 
affirmatively in [4] by the first two authors of the present paper, who exhibited an algorithm that 
whp survives for at least 0.535n rounds without creating the giant component. Since then, there 
has been a series of papers about the giant component in Achlioptas processes, where a variety of 
scenarios and goals (online and offline algorithms, delaying or accelerating the appearance of the giant 
component) have been considered. 

Here we consider a Ramsey-type version of controlled random processes. In this version, incoming 
random edges are colored by an algorithm in one of r colors, for a fixed r > 2. The goal of the 
algorithm is to achieve or maintain a certain monotone graph property in all of the colors. This 
setting originates in the papers of Rodl and Rucihski [22||23], who determined when G n ^ m satisfies the 
Ramsey property of having a monochromatic copy of a fixed graph H in any r-coloring of the edges. 
In our terminology they considered the offline version of the problem, and the property V to avoid in 
each color was the appearance of a copy of a fixed graph H. The online version of the problem for 
the case of two colors and H = K% was treated by Friedgut, Kohayakawa, Rodl, Rucihski and Tetali 
in [10], and extended to a wider variety of graphs by Marciniszyn, Spohel and Steger in 117] . The 
online setting of achieving Hamiltonicity in each of r colors has been addressed in |13j . 

In the present paper, we investigate several Ramsey-type problems involving the giant component. 
We consider whether or not it is possible to color the edges of G n . m in r colors with the objective 
of creating a giant component in every color class, or of avoiding a giant component in every color. 
We study both the offline and online settings. In the offline setting, an algorithm gets access to the 
entire graph, generated according to the probability distribution G n>m ; in the online setting the edges 
of G n . m are first ordered in a random order and then revealed to the algorithm one by one (i.e., the 
algorithm observes the random graph process and colors each new edge as it arrives). 

The main objective of this paper is to show new interesting questions, and not necessarily to get 
precise answers to all of them. We do determine the offline thresholds for these problems for all values 
of r, but the online setting remains open. There, we show that for two colors, there is always a 
separation phenomenon away from the trivial bounds, and then calculate asymptotic bounds for large 
numbers of colors. 



2 



As a warm-up, consider the offline threshold for creating a giant in every color. Recall that if 
m < (1 — e)^ for any fixed e > 0, then whp G n)Tn itself has all components of size O(logre). On 
the other hand, one can show that for m ^> (1 -)- c)^, whp it is possible to color the edges of G n ^ m 
with any fixed number of colors r > 2, so that every color class contains a component of order O(n). 
Indeed, Ajtai, Komlos, and Szemeredi proved in [2] that whp G n ,(i+ e )^ contains a path of length c e n. 
(Here and later in the paper, we will write c t to specify a positive constant determined only by e.) By 
splitting this path into r paths of length c e n/r, the result follows. 

The question of avoiding giants in all colors offline is not so simple. It turns out that the threshold 
for avoiding giants in r colors is precisely the same as that of r-orientability, which says that it is 
possible to direct all of the edges of the graph so that the resulting digraph has maximum in-degree 
at most r. Cain, Sanders and Wormald [8j, and Fernholz and Ramachandran [9] recently discovered 
that this threshold coincides with the number of edges needed to make the (r + l)-core have average 
degree above 2r. More precisely, they showed that for any integer r > 2, there is an explicit threshold 
tp r such that the following holds. For any e > 0, if m > (tp r + e)n, then whp G n ^ m contains a subgraph 
with average degree at least 2r + c t , where c e > 0. On the other hand, if m < (ip r — e)n, then G Utrn is 
r-orientable whp. The asymptotic dependence of ip r on r is tjj r = r — ^ (~ + o(l)) r , as calculated in 
[8]. We now state our first main theorem in terms of this threshold. 

Theorem 1.1. Given any fixed r, let ip r be the threshold referenced above. For any e > 0, if m < 
(?/v — e)n, then whp it is possible to color the edges of G n ^ m with r colors such that each color class 
contains components of order only o(n). On the other hand, if m > (tp r + e)n, then whp every 
r- edge- coloring of G n , m has a color class with a component of order at least c e n. 

Remark. This was also recently and independently discovered by Spohel, Steger, and Thomas |25j . 

We also consider online versions of these problems, in which the m edges come sequentially, and 
each must be colored as soon as it appears. Precisely, we consider the process to be a sequence of m 
rounds. In each round, a random edge arrives, independently and uniformly distributed over all pairs 
of vertices. If it repeats an existing edge, then we do not force ourselves to recolor it. This is not an 
important issue, because we will never consider more than 0(n) rounds, but it is more convenient to 
use this product probability space with full independence between the rounds. 

Here, we have several results. First we state them for avoiding giants in all colors. The offline 
upper bound of course supplies an upper bound for the online case as well. Indeed, a standard coupling 
argument (Fact [231 in the next section) translates the offline upper bound to the case where the rounds 
have independent edges (possibly with repetitions). So, after (ip r + e)n rounds, whp every possible 
coloring of them contains a giant component, where the dependence of ^ r on r is ^ r = r ~\ (f + °(l)) r - 

On the other hand, by taking the natural online adaptation of the offline avoidance strategy, which 
was based on edge orientation, we found a randomized online algorithm which matches the first-order 
asymptotic of ip r = (1 — o(l))r. 

Theorem 1.2. For any e > 0, the following holds for all sufficiently large r. There is an online 
randomized algorithm which can last for (1 — e)rn rounds, while keeping all connected components in 
each of r color classes smaller than o{n) whp. 

For large r, this is asymptotically a factor of 2 better than the trivial bound of (1 — e)^ rounds, 
obtained by coloring each edge independently at random. On the other hand, for small r, beating the 
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trivial bound corresponds to using e = 0.4999 above (say), and this requires r > 50. For the extreme 
case of small r, we have the following result using an entirely different strategy, which improves upon 
the trivial bound for all r by a factor of approximately 1.06. 

Theorem 1.3. There is an online algorithm which can 2-color edges for 1.06n rounds, while keeping 
all connected components in both color classes of size at most O(logn) whp. 

Remark. Although the theorem is stated only for r = 2, it immediately gives a strategy for all 
even r, by splitting the colors into | pairs. At each round, one of the color pairs is randomly chosen, 
and the above algorithm is used to decide which of the two colors in the pair to use. Then, this will 
avoid giants in all colors for 1.06n • | rounds whp. For odd r, one can run the above modification for 
1.06ra • rounds using only the first r — 1 colors, and then an additional (1 — e)S rounds using only 
the r-th color. This beats the trivial bound of ^ by a factor which approaches 1.06 as r grows. 

When the objective is to create giants in every color class, the trivial bounds are as follows. 
Certainly, if fewer than (1 — e)S edges are observed, then whp there will be no giant in the uncolored 
graph, so one cannot hope to create r monochromatic giants any faster. Note that this trivial lower 
bound turned out to be the truth in the offline setting, even though it does not grow with r. We will 
show that in the online case, there is a lower bound which does. 

Theorem 1.4. There is a constant c ~ 0.043 such that after (clog 2 r)n edges are r-colored by any 
online algorithm, whp some color class still has all components of order only O(logn). For r = 2, the 
same result holds for c'n edges for any d < 2 — ^/2 ~ 0.586. 

On the other hand, the trivial strategy of randomly coloring each edge succeeds when the number 
of edges surpasses rn/2. We are able to give an online algorithm which asymptotically performs far 
better than the trivial one. 

Theorem 1.5. There is an online algorithm such that for any e > 0, after (c r +e)n edges every color 
class contains a connected component of order at least c e n whp, where the dependence of c r on r is 

c. = (i + (i)p£ 

For the specific case of 2 colors, one can adapt the argument and obtain a value of C2 = |, but we 
give a slightly more sophisticated strategy which creates giants even faster. 

Theorem 1.6. There is an online algorithm such that for any e > 0, after 0.733n rounds both color 
classes contain connected components of order at least c e n whp. 

This paper is organized as follows. The next section reviews some standard probabilistic facts, and 
then develops a general tool which extends a recent result of Spencer and Wormald from [24] . This 
allows us to control the evolution of the susceptibility of a graph under the addition of random edges. 
Section [3] completely resolves the offline case, by proving Theorem II .li For the online setting, Sections 
H] and [5] consider the respective problems of avoiding and creating giants. The final section contains 
some concluding remarks. 

Throughout our paper, we will omit floor and ceiling signs whenever they are not essential, to 
improve clarity of presentation. All logarithms are in base e ~ 2.718 unless otherwise specified. The 
following asymptotic notation will be utilized extensively. For two functions /(n) and g(n), we write 
f(n) <C g(n), f{n) = o(g(n)), or g(n) = w(/(n)) if lim^oo f(n)/g(n) = 0, and f(n) = 0(g{n)) or 
g(n) = f2(/(n)) if there exists a constant M such that |/(^)| < M\g(n)\ for all sufficiently large n. 
The number of vertices n is assumed to be sufficiently large where necessary. 
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2 Preliminaries 



In this section, we review some standard facts commonly used in Probabilistic Combinatorics. Then, 
we use them to prove a useful result (Theorem 12, 6p which shows that a certain graph parameter, the 
susceptibility, tracks a natural differential equation. This extends a result of Spencer and Wormald, 
and we state it in a general-purpose form for the convenience of possible future citations. 

2.1 Probabilistic tools 

We recall the Chernoff bound for exponential concentration of the binomial distribution. The following 
formulation appears in, e.g., pQ. 

Fact 2.1. For any e > 0, there exists c e > such that any binomial random variable X with mean // 
satisfies P [\X - /i\ > efi] < e~ c ^. 

A binomial random variable is the sum of independent indicator variables. We also need concen- 
tration in settings without complete independence. Recall that a martingale is a sequence Xq,Xi, . . . 
of random variables such that each conditional expectation E [^t+i | Xq, . . . , Xt] is precisely Xt. The 
Hoeffding-Azuma inequality (see, e.g., pQ) provides concentration for martingales with bounded step- 
wise increments |ATt+i — Xt\, and this has been widely used in probabilistic combinatorics. 

When only one-sided concentration is needed, it can be convenient to consider instead a super- 
martingale, which only requires E [X^+i | Xq, . . . ,Xt] < Xt for all t. We will use the analogue of 
Hoeffding-Azuma for supermartingales, which follows from exactly the same proof as for martingales 
(see, e.g., [15J or |26j). 

Fact 2.2. Let Xq, . . . ,X n be a supermartingale, with bounded differences — X%\ < C . Then for 

any A > 0, 



We can also define submartingales via the requirements E [Xt+\ \ Xq, . . . ,Xt] > Xt; estimates on 
their lower tails, similar to the above fact, follow by symmetry. 

Finally, we will frequently switch between the models G n<p , G Ujm , and the product space of m 
independent uniform random edges, depending on which one is the most convenient. Adding more 
edges makes it harder to avoid giants, but easier to create them, so all properties we consider are 
monotone. Hence the following fact allows us to translate results between the models, while still 
keeping everything sharp to first-order. 

Fact 2.3. Fix any constant e > 0, and suppose that m = m(n) tends to infinity with n, but m = o(n 2 ). 
Then there are couplings of the corresponding probability spaces such that the following hold. 

(i) G n/m C G UtP whp f rp={l + e)^f, and G n . m D G n , p whp for p = (1 - e)^f. 

(ii) The graph formed by generating m random edges (possibly with repetition) is always contained in 
G n ^ m , and whp contains G n m > with ml = (1 — e)m. 

Proof sketch. By the standard coupling of G n)7n and G UtP via the random graph process, part (i) 
follows from the Chernoff bound on Bin^ghp] • For part (ii), one can similarly couple G n ^ m i with the 
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product space of m edges by considering an infinite sequence of independent random edges. Then, 
the m-edge product space is the projection onto the first m choices, and G n>m > is the graph consisting 
of the first m! distinct edges. So, it suffices to show that whp, there are at least (1 — e)m distinct 
edges among the first m sampled with replacement. Observe that when the k-th edge is sampled, the 
probability that it is a repetition of a previously sampled edge is always less than k/ Q) < § since 
m = o(n 2 ). Therefore, the number of samples which are repetitions is stochastically dominated by 
Bin[m, |1 , which is at most em whp by the Chernoff bound. Then, the number of distinct edges is 
at least (1 — e)m, as desired. □ 

2.2 Evolution of susceptibility 

One of the most useful parameters for studying the giant component of a graph is the susceptibility. 
For a graph G, this is defined as S(G) = ^ Yl v ^v, where C v is the size of the connected component 
in G containing v. Note that this also equals — times the sum of the squares of the component sizes. 
Many researchers have investigated the evolution of the susceptibility under random edge addition, 
starting with Bohman and Kravitz, who used this to analyze the Achlioptas process in [6]. 

More recently, Spencer and Wormald proved in [24J that for m up to (1 -e)f , the susceptibility 
of the m-edge random graph evolves like the solution (f>(m) of the differential equation (/>' = ^(j) 2 
with initial condition <^>(0) = 1. The heuristic for this differential equation is quite natural, although 
the formal proof is nontrivial. Indeed, when a random edge is added to some intermediate (and 
subcritical) G, its endpoints typically lie in different components, each of which has expected size 
S(G). If both component sizes are close to S(G), then the increment to S(G) after adding the edge 
is roughly ±[(S(G) + S(G)) 2 - 2S(G) 2 ] = ^S(G) 2 . Thus, one might expect the evolution of S{G) to 
follow (f)' = §0 2 . The solution of this differential equation is cfi(m) = (l — §m) , so it only blows up 
when m reaches ^. This matches the classical threshold of the giant component, because the result of 
Spencer and Wormald concentrates S(G ntTn ) around 4>(m) for m up to (1 — e)^. In this range, S(G njm ) 
is then bounded by a constant, and we can always trivially bound the size of the largest component 
by y / nS(G), so the largest component is o(n) whp. 

However, once we start to color edges, the color classes are no longer Erdos-Renyi random graphs. 
It is then crucial to control the evolution of susceptibility from initial graphs which are non-empty. 
One of the main contributions of [21] was a result of this nature, but it only controlled one phase of 
evolution. In order to formulate it, we need the following definition. 

Definition 2.4. A graph has a K,c component tail if for all positive integers s, at most Ke~ cs - 
fraction of its vertices lie in components of order at least s . 

Note that a K, c component tail immediately implies that all components have order 0(log n). Now 
we restate a key result of Spencer and Wormald (Theorem 3.1 of |24j ) . translated into an equivalent 
form via Fact 12.31 

Fact 2.5. Let L,K,c,~f be positive real numbers. Let G be a graph on n vertices with a K,c component 
tail and S(G) < L. Add (1 — 7)5^ independent random edges to G, ignoring repeated edges, and let 
the result be G . Then there exist K' \d such that G has a K',c' component tail whp. 

The K' , d component tail is very useful, because it bounds the entire distribution of the component 
sizes. However, our arguments also need control of the new value of the susceptibility after random 
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edge addition, so we prove the following extension of the above result. This can be done using the 
methods used in [24] , but we include here an alternate (and simpler) proof, following ideas from [3]. 

Theorem 2.6. Let L, K, c, 7 be positive real numbers. Let G be a graph on n vertices with a K,c 
component tail and S(G) < L. Add (1 — 7)5^ independent random edges to G, ignoring repeated 
edges, and let the result be G' . Then there exist K',c' such that whp G' has a K',c' component tail, 
and S(G') + o(l). 

Remark. The bound — arises from the following heuristic. Suppose that the initial susceptibility 
is L. We will show that its evolution is dictated by the differential equation <p' = —cfp' with initial 
condition (p(0) = L, whose solution is <p{t) = (jj — jjt) ■ Substituting t = (1 — 7)5^ gives ^. 

Proof. Note that by definition, the susceptibility is always at least 1, so we will implicitly use L > 1 
throughout the proof. Let T = (1 — rfjk- Let e%, . . . , &r denote the incoming edges, and let Gt be 
the graph after the addition of the first t of them. Fact 12.51 gives constants K' , d such that Gt has a 
K' ',d component tail whp. 

Let <j)(t) = (jj — ^t) . We now formalize our heuristic argument which suggests that S(Gt) evolves 

5L t 1 

like 4>(t). For each t, let Et be the event that Gt has a K' , d component tail and S(Gt) < 4>(t)+e 1 n n~ 3 . 
Note that we will only run t up to T < n, so the exponential factor is only at most a constant, and 
hence the error term tends to zero as n grows. Now, consider the sequence of random variables: 

f SL t 1 

\S(G t ) - (j)(t)-e~"n-3 if holds, 
Xt — < 

[Xt_i otherwise. 

We claim that X t is a super martingale. Indeed, suppose that Gt has components of order C\, C2, ■ ■ ■ 
If the incoming edge ^1^2 has v\ in the i-th component and 1)2 in the j-th component, then the 
susceptibility increases by exactly \\{Ci + Cj) 2 — Cf — C?] = ^CiCj when i ^ j, and zero otherwise. 
Therefore, 

E[S(G t+ i) I ei,...,e t ] = 

< 



We use this to bound the expected conditional increment in X t . Note that for the purposes of bounding 
E [-X^+i I ei, . . . ,et] we may assume that Et holds (otherwise this conditional expectation is trivially 



S(G t ) + Y,-^C r --^ 
^-^ n n n — 1 



S{G t ) + 



n-1 U Y J 



S(G t ) + -S{G t ) 2 . 

n — 1 
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equal to Xt). Using the above, and the convexity of 4> and the exponential, we have: 



E[X; 
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We will only run t up to T = (1 — 7)^, so we always have ~ < 1, as well as <p(t) < — because 4> is 
increasing. Plugging in these bounds, the (f>(t) and exponential factors are replaced by constants, so 
the asymptotic behavior of each term is determined by the power of n in the denominator. Hence the 
second and fourth terms dominate, giving 



ELY, 



X t I ei,...,e t ,£ t ] < (l + o(l)) 
= "(l + o(l)) 



n 4/3 7 
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L 5L 
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5L 1 
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e t 
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which is negative for sufficiently large n. Therefore, Xt is indeed a supermartingale. Observe that 
X = -n" 1 / 3 . We will use the Hoeffding-Azuma inequality (Fact 12 .2[) to prove that whp, X t < for 
every t <T. For this, note that the one-step change in Xt is zero if Gt does not have &K' ,c' component 
tail. Otherwise, as previously remarked, all components of Gt are bounded by some Clogn, so the 
maximum change in the susceptibility is ^-(Clogn) 2 . To bound the one-step change in the error term 

BL t _ 1 

<p(t) + e t 71 n 3 , which is an increasing convex function, it suffices to use the first derivative at t = T. 
Recalling that T = (1 — 7)^, this turns out to be precisely 



d 
dt 



t=T 



- —T 

L n 



+ 



e 7 n n 



7n 



7\-2 2 slt 5L 
— - + e 7 n — . 



which is clearly 0(n 1 ) because 7 and L are constants, and T < n. Applying the Hoeffding-Azuma 
inequality with A = n -1 / 3 , we find that for each t < T < n, 



[X t > 0] < exp 



n 



-2/3 



2.(|(Clogn)2) 2 ^ 



< exp 



n 



1/3 



8C 4 log 4 n 



A union bound over all t < T shows that whp, all Xt < 0. Furthermore, Fact 12.51 implies that whp, 
Gt has a K' ,c' component tail. 

To complete our argument, we claim that whenever all of these high-probability events happen, 
then all St occur for < t < T. We prove this by induction on t. Each Et has two parts: a 
component tail and an upper bound on S(Gt). We will never need to worry about checking the 
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K' , d component tail property because Gt C Gt for all these t, and we already conditioned on Gt 

having that (monotone) property. We concentrate on the upper bounds for S(Gt) in the remainder 

of this proof. For the base case t = 0, the susceptibility part of So is immediate by definition since 

— — _i 

S(Gq) = (f)(0) < 0(0) + e t n n~3. For our induction step, given that &t-\ occurs, the definition of X t 

5L t i 

is then S(Gt) — 4>{t) — e i ™n~z instead of the alternative X t _i. Yet we assumed that X t < 0, so that 
gives the susceptibility part of St, and completes the induction. 

Therefore, we conclude that St occurs whp, which in particular means that S(Gt) < <t>(T) + 

5LT _ i - 

e 7 n n 3= ^_)_ o(l), as desired. □ 



3 Offline avoidance of giants 

In this section, we prove Theorem which has two parts, a lower and an upper bound. The lower 
bound relies on the following relationship between orientability and decomposition. Recall that we 
call a graph r-orientable if it is possible to orient all edges such that all in-degrees are at most r. 

Lemma 3.1. The edges of any r-orientable graph G can be colored with r colors such that for every 
pair of distinct vertices u, v, there are at most 2 monochromatic paths in each color connecting u and 
v. 

Proof. Fix an orientation of G with all in-degrees at most r, and greedily color the edges by r colors 
so that at each vertex, all incoming edges are differently colored. Consider a particular color class. By 
construction, it is a directed graph with all in-degrees at most 1, so it is a disjoint union of unicyclic 
components. Then, every pair of vertices is linked by at most two paths in that color, as desired. □ 

The previous lemma produces a coloring whose connectivity is very fragile. Our next lemma 
quantifies this, showing that the (a priori, possibly large) monochromatic components shatter easily. 

Lemma 3.2. For any e > 0, there is c > such that the following holds. Let G be a graph on n vertices 
with maximum degree logn, where every pair of distinct vertices is connected by at most 2 distinct 
paths. Independently delete each edge of G with probability e. Then, whp all connected components of 
the resulting graph have order at most ne logiogn = (n). 

Proof. Define c such that 1 — e = e _8c , and let the susceptibility be - Y^, v C v , where C v is the size of 
the connected component containing v. Let the random variable S be the susceptibility of the graph G' 
which remains after the edge deletions. Since Yl v equals the sum of the squares of the component 
sizes, all components of G have order at most ynS. Thus, it suffices to show that S < ne io g io g n 
whp. 

Fix an arbitrary vertex v. Since G has maximum degree logn, the total number of vertices within 
distance D = | i Q ^g n of v is at most (logn) 13 = ^fn. Any other vertex u has probability at most 
2(1 — e) D = 2e~ 8cD of being connected to v after the deletion. This is because there are at most 2 
paths between u and v, and each path has length at least D. Therefore, by linearity of expectation, the 

log n 2^ log n 

expected size of the component containing v is E [C v ] < y/n + n • 2e~ logiogn < ne ~ c io g io g n _ Another 

3 e log n 

application of linearity of expectation gives E [S] < ne iogiogn . So, by Markov's inequality, S exceeds 

2 e log " c log n 

ne logiogn w ith probability at most e logiogn — o(l), completing the proof. □ 
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Remark. The self-contained argument above only requires a relatively weak maximum degree con- 
dition, and is sufficient for our purposes. It is worth mentioning that under the stronger assumption 
that G is a random graph, one can use the substantially less trivial Lemma 11 of [5] to sharpen the 
eventual bound to n 1-Ce , as Spohel, Steger and Thomas do in [25]. Indeed, that lemma claims that if 
m = cn, then there is a large constant K such that in G n m , the number of vertices within distance 

log2K 

(log n)/K of any vertex v is at most n k whp. Using this fact above instead of our exploration to 
depth ^ logiogn bounds all connected components below n 1_CE . 

Let us now state the result of Cain, Sanders and Wormald [8], and Fernholz and Ramachandran 
[9], on the matching thresholds for orientability and average degree. 

Fact 3.3. For any integer r > 2, there is an explicit threshold ip r such that the following holds. For 
any e > 0, if m < (ip r — e)n, then G n ^ m is r-orientable whp. On the other hand, if m > (ip r + e)n, 
then whp G n ^ m contains a subgraph with average degree at least 2r + c t , where c e > 0. 

We are now ready to prove the lower bound, which we first translate to G UjP for convenience. By 
applying Fact 12.31 and rescaling e, it suffices to show that if p = 2(1 — e)(ip r — e)/n, then whp there is 
a coloring of G np where every color class has all components of order o(n). 

Proof of lower bound of Theorem 11.11 Let p' = 2{ip r — e)/n, and observe that G n ^ p can be 
obtained from G' = G U:P i by independently deleting each edge with probability e. First, consider the 
graph G' before deletions. By Fact 13.31 G' is r-orientable whp. Also, it is easy to see that since np is 
at most the constant 2ip r , G' has maximum degree at most logn whp. Indeed, each individual degree 
is distributed as Bin(n — l,p), and P [Bin(n — l,p) > logn] < (i g n )p logn < (jipr) ° gn . Since np is 
bounded by a constant, this is o(re _1 ), so a union bound over all n vertices implies that the maximum 
degree is at most logn whp. 

Thus, by Lemma[3Tj we can color the edges of G' so that every pair of distinct vertices is connected 
by at most two paths in each color. This, together with our degree bound and Lemma [3.21 shows that 
after deleting each edge of G' independently with probability e (to obtain G UjP ), whp all color classes 
have connected components of order only o(n). □ 

For the upper bound, we use the second half of Fact 13.31 which gives a subgraph of high average 
degree. It turns out that this is already enough to ensure a giant. To see this, we first show that small 
sets of vertices typically induce low average degree in the random graph. 

Lemma 3.4. For any A, e > 0, there is a constant c > such that in G HiP with p = ~, whp every set 
of at most cn vertices induces a subgraph with average degree less than 2 + e. 

Proof. Without loss of generality, assume that e < 1 and A > 1. Let c = (e 3 A 2 )~~. We will take a 
union bound over all subsets of t < cn vertices. For a fixed value of t, the probability that some i-set 
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of vertices induces at least (l + |)i edges is at most 
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To complete our union bound, we sum the final expression over the range 1 < t < cn. We split this into 
two intervals, separating at t = logn. Observe that the quantity in the square brackets increases in t, 



and reaches tt when t 



cn. 



So, the sum over the interval logn < t < cn is at most Yliogn ^ * = °(-0 



For the other interval t < logn, the square bracket is still at most \ < 1, so we can ignore the outer 

exponentiation and conclude that the final expression is at most • (^p) 2 . Multiplying this by 
the number of values of t in this interval (logn), we see that the final sum is still o(l). Therefore, the 
property holds whp, as claimed. □ 

From this, we immediately derive the following useful corollary, which ensures a giant in any 
subgraph of average degree at least 2 + e. 

Corollary 3.5. For any A, e > 0, there is a constant c > such that in G n ^ m with m = \n, whp 
every subgraph with average degree at least 2 + e contains a connected component of order at least cn. 

Proof. By the previous lemma and Fact 12.31 whp G n , m has the property that every set of at most 
cn vertices induces a subgraph with average degree less than 2 + e. Then, consider any subgraph H 
with average degree at least 2 + e. Separating H into its connected components, we find that some 
component must have average degree at least 2 + e. Therefore, that component must have order at 
least cn, as desired. □ 



Proof of upper bound of Theorem 11.11 By Fact 13.31 if ni > (ip r + e)n, whp G n m contains 
a subgraph H with average degree at least 2r + c t . No matter which colors appear on the edges of 
H, some color class will have average degree at least 2 + c e /r, and therefore contain a giant whp by 
Corollary E21 □ 



4 Online avoidance of giants 

In this section, we consider the online case of the avoidance problem. We first show that a natural 
adaptation of the offline algorithm gives an asymptotically sharp result for large numbers of colors. 
Then, we consider the other extreme with 2 colors, and show that the trivial bound is not tight there. 
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4.1 Many colors 

Our offline algorithm avoided giant components by orienting edges to minimize in-degrees. By replacing 
the offline orientation procedure with an online one, this strategy naturally extends to the online 
setting. Online edge orientation has been extensively studied, in the famous equivalent formulation 
known as the "power of two random choices" with balls and bins (see [18J for a survey of results). 
Indeed, that setting had n bins, with kn balls coming sequentially, each with two independent random 
choices for a destination bin. The objective was to control the maximum load across all of the bins. 
This can be interpreted as a graph orientation problem, where each pair of bin choices corresponds 
to an incoming edge with the two choices as endpoints. The edge's orientation records which bin the 
ball is sent to, and the goal of controlling the maximum in-degree is precisely the same as that of 
controlling the maximum load in the balls-and-bins problem. 

It is now well-known that when the objective is to minimize the maximum in-degree, the stochasti- 
cally optimal online orientation strategy is to always orient each incoming edge towards the endpoint 
which currently has lower in-degree. However, it turns out that for the purpose of proving Theorem 
11.21 one can use a random orientation strategy, which is easier to analyze. Our coloring algorithm, 
which we call orient, internally maintains a set of orientations for all edges it has seen. To color a 
new edge e, it randomly orients it with equal probability toward one of its endpoints. Let the new 
in-degree of that endpoint be d. If d < r, then color d is used for the edge e. Otherwise, color r is 
used. Observe that just as in Lemma 13.1^ each of the first r — 1 color classes is a disjoint union of 
unicyclic components. Therefore, each of these color classes has every pair of vertices connected by at 
most two paths, so it will shatter by the same argument as in the proof of Theorem 1 1.11 

The new challenge in this section is to control the r-th color class. Fortunately, it turns out that 
it is extremely sparse. To prove this, it is more convenient to work in the random directed graph 
Gn tP , in which each of the n(n — 1) possible directed edges appears independently with probability 
p/2. Note that in this model, it is possible for both uv and uv to be present simultaneously. Our 
first claim is that G n ^ p typically has no long cycles containing many vertices of high in-degree. This 
is relevant because every edge in color r has an endpoint with in-degree at least r. 

Lemma 4.1. For any e > 0, the following holds for every sufficiently large constant r. Let G = G n ^ p 
be a random directed graph with p = (1 — e)^-, and G be the undirected graph on the same vertex set 
obtained by collapsing all edges between each vertex pair into a single undirected edge. Then, whp G 
does not contain any cycles of length at least \/log n for which at least half of the vertices on the cycle 
had in-degree at least r in G . 

Proof. We will use a union bound to show that a large family of objects do not appear in the random 
directed graph. Let us define an isomorphism type to be a cycle, say with vertices v\, . . . ,vt, whose 
edges have been oriented arbitrarily, along with a subset of at least t/2 of its vertices which have 
been designated as "high-in-degree vertices." Note that we do not require the edges of the cycle to 
be oriented in a consistent direction. The number of distinct i-vertex isomorphism types is at most 
2* • 2*, because each of the t edges can be oriented in 2 ways, and the number of different subsets of 
vertices that can be designated as high-in-degree is at most 2*. 

We say that G niP contains a copy of this isomorphism type if there is an embedding of the vertices 
Vi such that all consecutive edges i>ii>j+i are present in the correct direction, and all designated high- 
in-degree vertices V{ already have in-degree at least r — 2 from vertices other than Vi—x,Vi+i. We do 
not restrict our attention to induced copies, so other edges may also be present. If we can show that 
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over all isomorphism types with t > \/Togn, the expected total number of copies in G n ^ v is o(l), then 
we will be done by Markov's inequality. 

So, let us focus on a particular isomorphism type with t vertices. There are at most n t ways to 
embed the t vertices of the cycle. Each edge 1 independently appears with its correct orientation 
with probability exactly p/2. Next, consider a designated high-in-degree vertex Uj. Crucially, we only 
require in-degree at least r — 2 from vertices other than and fj+i. The reason for this exclusion 
is that the previous step may already have exposed the edges Vi-\v\ and Vi + iv\. But now, since our 
model even allows edges in both directions between vertex pairs, the probability that each designated 
vertex indeed has high in-degree is independently P [Bin[n — 3, |] > r — 2] . Since p = (1 — e)^ and r 
is large, each of these individual probabilities is bounded by the probability that Bin[n — 3, (1 — e)^] 
exceeds its mean by at least an ^-fraction. By the Chernoff bound, this happens with probability at 
most e~ c " r for some constant c e . By choosing large enough r, we may assume that this is below g|^r- 
Putting everything together, we find that the expected number of copies of the isomorphism type in 
G njP is at most 

n (2) ((MrV ~ (s 

We initially showed that the number of distinct t- vertex isomorphism types is at most 4*, so the 
expected total number of copies of all t- vertex isomorphism types is at most 2 — *. This is a geometric 
series, so its sum over all t > ydogn is still o(l), as desired. □ 



Remark 1. Since we had a convergent geometric series at the end of the proof, the v^log n bound is 
not tight. In fact, any function which grows with n is sufficient. 

Remark 2. If one is interested in beating the trivial bound, which corresponds to p ~ ^, one can 
choose e to be extremely close to, but just below, |. One can numerically check that if e = 0.4999 and 
r > 51, then the probability that Bin[n, (1 — e)^] exceeds r — 2 is at most 1& \ r -i for large n, because 
the Binomial converges to a Poisson variable with mean 0.5001r. Continuing the argument, this will 
show that the expected number of appearances of all t- vertex isomorphism types is at most (-j==j , 
which is still a convergent geometric series, so the same result will follow. 

Next, we establish an easy bound which holds for ordinary random graphs. 

Lemma 4.2. For every constant c, whp in G ntP with p = — , every set of t < vdogn vertices induces 
at most t edges. 



Proof. The expected number of sets with t < y4ogn and at least t + 1 edges can be bounded by 
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We now combine the previous two lemmas to show that the r-th color class shatters easily. In the 
proof of Lemma 13. 2\ the control of connectivity was done by bounding the number of distinct paths 
between every pair of vertices. This time, we use the notion of an essential edge. We say that an edge 
e on a path is essential if every other path connecting the same endpoints also contains e. It turns 
out that in the r-th color class, every long path contains a huge number of essential edges. 
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Lemma 4.3. For any e > 0, the following holds whp for every sufficiently large constant r. Let G be 
the graph formed by the r-th color class after (1 — e)rn independent random edges have been colored 
by orient. Then every path in G of length at least ydogn has the property that more than half of its 
edges are essential. 

Proof. Since each (random) incoming edge is randomly directed by orient, one can think of the 
input as a sequence of random directed edges, which is then deterministically colored using the rule in 
orient. By a similar argument to Fact 12.31 h suffices to consider the more convenient model where 
the input sequence is a random permutation of the edges of a random directed graph G = G np with 

V U oif- 

Note that if an edge of G is oriented toward a vertex with in-degree less than r, then regardless of 
the permutation, it will never be colored r. So, let H C G be obtained by deleting all edges oriented 
into vertices of in-degree less than r. Then H entirely contains the r-th color class. Let A be the set of 
vertices whose in-degrees were less than r, and let B be those that had in-degree at least r. Observe 
that we deleted all edges oriented toward vertices in A, so A spans no edges in H. In particular, any 
cycle in H has at least half of its vertices in B, i.e., with in-degree at least r. 

Therefore, by Lemma 14.11 whp all cycles in H have length at most ^log n. Also, condition on 
the result of Lemma 14.21 which shows that in G (and hence also H ) , every set of t < \/logn vertices 
induces at most t edges. These two graph properties will be enough to show that long paths in H 
contain many essential edges. 

Let P = vi, . . . ,vt be a path in H with length at least ydog n. Suppose for contradiction that 
at least half of its edges are non-essential. We claim that since ^logn <C v^log n, there must be 
non-essential edges tw+i and VjVj + \ such that i < j and S-^log n < j — i < 7^/Iogn. Indeed, if this 
were false, then out of the 7-^log n edges immediately following each non-essential edge in P, at least 
|-fraction of them would be essential. Then an averaging argument would contradict the fact that at 
least half of the edges were non-essential. 

Now, since UjUi+i is non-essential, there is another path P' = w\, . . . ,w s with w\ = v% and w s = v% 
which avoids the edge fjfi+i. Let a be the largest index such that w a £ {v\, . . . , Uj}, and let b be the 
next index after a such that Wf, E P. These exist because P and P' both contain v\ and vt- Note that 
by definition, Wb is actually in {fj+i, . . . , Vt\, and the segment of P' from w a to Wb intersects P only 
at w a and Wb- So, there is a cycle C\ formed by going from w a to u>b along P' , and then back to w a 
along P. Importantly, the common edges between C\ and P are a contiguous interval containing the 
edge ViV i+1 . 

Similarly, we can find a cycle C% containing the edge VjVj+i. Crucially, C\ and C2 are distinct 
(although not necessarily disjoint) because j — i > 3 v^log n and we conditioned on all cycles being 
shorter than y/\ogn. Yet j — i < 7-^logn, so the union of C\, C2, and the path fjfi+i ■ ■ - Vj forms a 
subgraph of order k < 9v / logn which spans at least k + 1 edges. Since we also conditioned on all such 
subgraphs having order at least -^Togn, this is a contradiction. Therefore, the path P must have had 
at least half of its edges essential. □ 

The previous lemma shows that the r-th color class is typically quite fragile as well. We now 
combine this with an adaptation of our offline argument, and prove that it is possible to avoid giants 
in all colors for nearly rn rounds whp. 

Proof of Theorem 11.21 By rescaling e, it suffices to give a randomized coloring algorithm that 
avoids giants in all colors whp, for a sequence of m = (1 — e) 3 rn independent random edges (possibly 
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with repetitions). As in our proof of Theorem 11.11 it is convenient to color a slightly denser random 
graph, because the deletion of fictitious edges shatters all large components. 

Strictly speaking, we cannot simply apply orient to a larger sequence of edges, because for this 
problem the input is a sequence of m edges, which must be processed online. We will therefore take 
some care in specifying how we randomly interleave the input into a longer sequence of edges, so 
that all operations are clearly online. Let us denote the final sequence of real and fictitious edges by 
ei, . . . , e m >, where m' = (1 — e) 2 rn. Initially, we select a random subset of m of the m! indices to 
correspond to the positions of the real edges. We then generate independent random edges for all 
other ei, and pass the resulting sequence to orient. Note that since the input distribution is uniform 
over all sequences of m edges, the augmented sequence of edges consists of m! independent random 
edges. 

Let a denote the colored sequence of m! edges produced by orient. The graph formed by a has 
maximum degree at most logn whp by the same argument as in the offline case. We also know by 
construction that there are at most 2 paths between every pair of vertices in each of the first r — 1 
color classes. For the r-th color class, Lemma [4 . 3 1 ensures that whp, all paths longer than ^logn have 
at least half of their edges essential. Let V denote the collection of all of these properties. We will 
write a £ V when all of them hold. 

Now, we delete the em' fictitious edges to recover the coloring of the original edges. Note that since 
the algorithm knows which m edges are real (that was the input), the edges to delete are completely 
determined. But crucially, it used an independent source of randomness to interleave the original m 
edges into the full sequence of m! edges. Therefore, if we only condition on a (and not on the input), 
then the distribution of which m edges were original is uniform over all possible subsets of m positions. 
Formally, we are calculating the probability of success by summing over all colored sequences a of m! 
edges. We have 

P [success] = P [success | a] P [a] > ^ P [success | a] P [a] 

Since we showed that a G V whp, it suffices to show that P [success | a] > 1 — o(l) for all a 6 V . 
We noted above that conditioned on a, the em' edges to delete were uniformly distributed over all 
subsets. Therefore, it remains to show that given any coloring with property V, the deletion of a 
random e-fraction of its edges whp shatters all large connected components. We accomplish this by 
deleting every edge independently with probability |, which will imply the result by a similar coupling 
argument to Fact 12.31 since Bin(W, |) < em' whp. 

For each of the first r — 1 color classes, Lemma 13.21 shows that all components shatter to o(n) 
whp, as in the offline proof. For the r-th color class, we now adapt the proof of Lemma 13.21 to use 
essential edges. Indeed, let us bound the expected size of the component C v containing a particular 
vertex v after the deletions. Since the maximum degree in G n ^ rn is logn, the total number of vertices 
within distance D = \ ^°\^ n of v is at most (logn) D = ^fn. Any other vertex u is at distance at least 
D 3> \/\ogn away from v, so a shortest path from v to u contains at least D/2 essential edges. The 
deletion of any essential edge disconnects u, v, so if edges are deleted with probability |, then u and v 

remain connected only with probability at most (1 — §) £>/ ' 2 = e Cl °g!°g« for some constant c. Hence the 

log n 

expected size of C v is at most ^/n + ne ioiT°g" = o(n), and by linearity of expectation, the expected 
susceptibility E [S] of the graph after deletions is o(n). Since the size of the largest component is at 
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most y/nS, Markov's inequality implies that the r-th color class also has all components smaller than 
o(n), completing the proof. □ 



4.2 Two colors 

The trivial algorithm, which randomly colors each edge blue or red, clearly lasts for (1 — e)n rounds 
whp. We now present a better algorithm which lasts for 1.06n rounds whp. To color a new edge 
e, it considers the set of colors C that appear on isolated edges which are incident with any of its 
endpoints. (If e is not incident to any isolated edges, then C is empty.) When C contains exactly one 
color, the algorithm colors the edge e with the other color. Otherwise, it randomly colors e either blue 
or red with equal probability. 

We analyze this by tracking a certain partition of the vertex set. Split the set of isolated edges into 
two groups based on their color, and call them the red matching and the blue matching, respectively. 
After the A;-th round, let: 

I k = number of isolated vertices, 
Bk = number of vertices in the blue matching, 
Rk = number of vertices in the red matching, 

and let J k = n — I k — B k — R k be the number of remaining vertices. These parameters correspond 
to the decomposition of the graph into its isolated vertices, the blue matching, the red matching, and 
the remainder. 

Lemma 4.4. With probability 1 — o(l), the following hold for all t < 1.1: 
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Proof. The probability that a particular vertex is not incident to any of the first tn edges is exactly 
(^"^'n^f = (l — ~)* n > wri ich tends to e~ 2t from below as n grows. Routine calculus easily bounds the 
convergence rate by 0(n _1 ), so E [^^n] = e~ 2t + 0(n~ 1 ). Now consider the edge-exposure martingale 
where is the conditional expectation of It n given the first k rounds. Changing the outcome of any 
particular round can only affect It n by at most 2, and there are tn rounds to determine If n , so by the 
Hoeffding-Azuma inequality (see Theorem 7.4.1 of [lj) Im is within (say) ^n 2 / 3 of its expectation with 
probability e - n( - nl/3 \ This gives the desired asymptotic for I tn . 

We estimate Bt n next. We claim that conditioned on the first k incoming edges e±, . . . ,ek, the 
expected change B^+i — B^ is 

E[B k+1 -B k \ ei ,...,e k ] = 2-( I -±) 2 \-^ + 0{n- 1 ). (1) 

The first summand comes from the creation of a blue isolated edge from 2 isolated vertices, which 
contributes 2 to B k . The probability that both endpoints are isolated vertices is ^ • ^Er- Since 
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n (n-i) — = 0(n~ 3 ) and I k < n, this is (^*) — 0(n _1 ). The ^ factor comes from the fact that the 
edge is randomly colored blue or red. 

For the second summand, the only way we lose blue isolated edges is when an endpoint of the 
incoming edge is incident to a blue isolated edge. The probability that the two endpoints hit two 
different blue isolated edges (hence contributing —4) is • B *Z\ ■ On the other hand, the probability 
that they hit exactly one isolated edge (hence contributing —2) is 2 • (l — B *Z\ ) • Thus the expected 
contribution from these losses is 

( _ 4 ).^.^Z^ + ( -2).2-^fl-^=i') = -l^ + Otn- 1 ), 



n n — 1 n \ n — 1 / n 

matching the second summand. 

Since we showed that -Ifn = (1 — o(l))e -2 * whp, equation dU) suggests that b(t) = \B tn should 
satisfy the differential equation 

| = (e- 2 *) 2 - 46, 6(0) = 0, 

whose solution is 6(f) = te~ it . 

We now verify this formally, using the same method as for the proof of Theorem 12.61 For each k, 
let £ k be the event that \\lk ~ e ~~ \ — an d \\B k ~ ^(„) | — e~n~3 . Now, consider the sequence 
of random variables 

{B k — nb(-) — e~n,3 if £ k _i occurs, 
Wk-i otherwise. 
We claim that W k is a supermartingale. Assume that £ k occurs. Then, using (pQ) we obtain 



E[W fc+ i -W k | e 1 ,...,e k ,£ k ] 
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Since ^ < e « + n 3 and e « < 1, we have (^-) 2 < e « + 2n 3 + 0(n 3). Similarly, — < 
— 46(-) + 4e~n~3. Recall that for any twice-differentiable function /, Taylor's formula ensures that 
for any t, h, there is some < £ < 1 such that f(t + h) - f(t) = f'(t)h + \ f"{t + £,h)h 2 . Since the 
second derivative of our function b(t) is bounded on the interval < t < 1.1, Taylor's formula gives 

K^TT") — b(n) = n^'(n) + 0(re~ 2 ). By a similar argument, e 7( » ' — = ^e"^+0(n~ 2 ). Combining 
all of these estimates and using b' = e -4 * — 46, we obtain 

E[W fc+ i -W k I ei,...,e k ,£k] 

_i / _2. 7fc 1 , f k\ 7k _1 

< e « + 2n 3 + 0(n 3)-46l-j+4e™n 3 -61 — I - 7e » n 3 
2-3eV n~3 + 0(n~s). 



< 0, 



so W k is indeed a supermartingale. Next, we bound the stepwise differences W k+ \ — W k . The change 
in B k is at most 4, and our Taylor estimates show that the error term nb{^) — e~n3 changes by at 
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most an absolute constant because b'(— ) is bounded on k < l.ln. Therefore, the Hoeffding-Azuma 
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Since ^ > e « -n 3 and e « < 1, we have ^ >e « — 2n 3 . Also, — ^ > — 46{ — ) — 4e « 
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Using the same estimates as before for frf^rO — Kn) an( ^ e 
E 



7(fc+l) 



e » , we obtain 



W k+ i - W k | ei, . . -,e k ,£ k 

4k 



> e 



2n 3-46 



n 



7fe 1 , i, , 

4e~n 3 + 0(n i ) - 6 



7fc _ 1 

+ 7e ™ n 3 



7fe \ 1 i . 

2 + 3e~ )n~3 + 0(n- 1 ). 



> 0, 

so W k is a submartingale. Applying the Hoeffding-Azuma inequality once again we see that 



3k < l.ln : W h < 



< e 



(3) 



We have now shown that whp, < 0, > 0, and \^I k — e~~\ < n -1 / 3 for every k < l.ln. 
Whenever these all happen, the same induction argument as in the conclusion of the proof of Theorem 
12.61 shows that every £ k necessarily holds as well. In particular, 



Bl- 



ob ( — 

n 



Tk 2 o 2 

< e»n3 < ens, 



for all k < l.ln. This completes the proof for Bt n , and the result for Rt n follows by symmetry. □ 

Now that we have control of the vertex partition, we study the evolution of the susceptibility. We 
have symmetry between blue and red, so it suffices to show that the susceptibility of the blue color 
class does not blow up before 1.06n rounds. Let X k be the sum of the squares of the component sizes 
in the blue color class after the i-th. round. Note that this is precisely n times the susceptibility of the 
blue color class. We claim that h^tn tracks x(t), which is the solution of the differential equation 



dx 
~dt 



x 2 + 3b 2 



2bx, 



x(0) 



(4) 



where b(t) = te 4 *. Numerical methods confirm that this differential equation blows up only at 
t « 1.065, and x(t) < 209 for all t < 1.06. 
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Lemma 4.5. Suppose that \X k < 210. Then the expected change in X k is: 

E[X k+1 -X k \e 1 ,...,e k ;±X k <210] = (^j + ^ [4B 2 - 4B k X k - R\ + 2R k X k ] + O^ 1 ). 

Proof. Let the connected components in the blue color class be C\, C2, Suppose that the (k + 1)- 

st edge has endpoints in Cj, Cj. If z = J, or if the edge is colored red, then the sum of the squares of 
the blue components does not change. Otherwise, it increases by exactly (|Cj| + |C,-|) 2 — \ Ci\ 2 — \ Cj\ 2 = 
2| Ci 1 1 1 . Therefore, 



E[X k+1 -X k \e 1 ,...,e k ;^X k <210] = ^2|C||C,' ! '' ' 1 



n n 



Pij 



where p^ is the probability that an with endpoints in Cj and Cj is colored blue. Note that is 
usually rj, but is sometimes or 1 when the endpoints hit isolated edges. The factor of n — 1 in the 
denominator is cumbersome, so we will replace it with an n. To do this, note that 2|Cj| 2 |Cj| 2 • 

Pij < 2 (Ej \Ci\ 2 ) 2 = 2X 2 < 2(210n) 2 = 0(n 2 ). Since - 4j = 0(n- 3 ), the total additive error 

we will make by replacing the n — 1 with an n is 0(n _1 ). Therefore, 

E [X k+1 - X k \ ei,...,e k ; \X k < 210] = \ ^ ■ Pij + Oin' 1 ). 



Let S be the right hand side of the final equality, and let S' be what it would be if all p^ were equal 
to \. Then 

S> = ^\Cm\ 2 + 0(n-') < (^) 2 + 0(n- 1 ). (5) 

Now we estimate the total error we made in S' by replacing all p^ with \. Whenever p^ = 0, 
we overestimated by ^j|Cj| 2 |Cj| 2 , and when p^ = 1, we underestimated by that same amount. To 
systematically examine all of the cases when p^ / i, we classify the components Cj of the blue color 
class into types, which we represent with the letters B, R, I, and J. We say that Cj has type B if it is 
part of the blue matching (hence a single edge), type R if it is part of the red matching (hence a single 
vertex), type I if it is an isolated vertex, and type J otherwise. Now we break into cases depending on 
the types of Cj and Cj. In each case, we calculate the sum of all |Cj| 2 |Cj| 2 of that type. 

Case BB. In this case, both Cj and Cj have type B, meaning that they are isolated edges from the 
blue matching. If the incoming edge has one endpoint in Cj and one endpoint in Cj, our algorithm 
will definitely color it red, so pij = 0. Any |C| 2 |Cj| 2 of this type is precisely 2 2 • 2 2 = 16. The 
number of Cj of type B is -^r, because the blue matching consists of ^ isolated blue edges. So, 

the number of pairs Cj, Cj of type BB with i 7^ j is ^ ■ — l) = — 0(n). Therefore, the 
sum of all |Cj| 2 |C,-| 2 of this type is 4B k — 0(n). 

Cases BI, IB. Again pij = 0. Any |Cj| 2 |Cj| 2 of this type is precisely 2 2 • l 2 = 4. There are ^ • I k 
pairs Cj, Cj of type BI, and the same number of type IB, so the sum is 4B k I k . 
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Cases BJ, JB. Again p^ = 0. Let Z be the set of indices j such that Cj has type J. Since there are 
-^n components Cj of type B, the sum of |Cj| 2 |Cj| 2 over all pairs of type BJ alone is 



f £2 2 -|C/ = 2B k ^\C,\ 2 

j&z j&z 

2\ 



— 2B k {X k - J 

= 2B k ^X k - I k - R k ^ ■ 2 2 

= 2B k (X k - I k - R k - 2B k ). 

The explanation is as follows. X k is the sum of all \Cj\ 2 . Then, we break the sum over j Z of 
\Cj\ 2 into the cases when Cj has type I, R, or B, in which \Cj\ is always 1, 1, and 2, respectively. 

The total contribution from pairs of type BJ and JB is twice that from BJ alone, so it is 
4B k (X k - I k — R k — 2B k ). 

Case RR. Now p^ = 1. Any |Cj| 2 |Cj| 2 of this type is precisely l 2 • l 2 = 1. The number of C 
of type R is R k , because the red matching consists of ^ isolated red edges, which give R k 
isolated vertices in the blue color class. So, the number of pairs Cj, Cj of type RR with i ^ j is 



R k ■ (R k - 1) = R\ - 0(n). Thus the sum of |Cj| 2 |Cd 2 is R\ - 0(n). 



Cases RI, IR. Again = 1. Any |Cj| 2 |C,-| 2 of this type is precisely l 2 • l 2 = 1. There are R k ■ I k 
pairs Cj, Cj of type RI, and the same number of type IR, so the sum is 2R k I k . 

Cases RJ, JR. Again = 1. Let Z be the set of indices j such that Cj has type J. Since there are 
R k components Cj of type R, the sum of |Cj| 2 |Cj| 2 over all pairs of type RJ is 

Rk/^l 2 • \Cj\ 2 = R k (X k - I k - R k - 2B k ), 
j&z 

where we used the exact same calculation as in the case BJ for ^Cjez I Cj 1 2 - We double this to 
include the contribution from JR, and obtain a total sum of 2R k (X k — I k — R k — 2B k ). 

All other cases. For all other pairs of types, our algorithm chooses a random color, so pij = ^, and 
there is no difference between S and S' . 

Combining all of the above calculations, we express E [A"&+i — X k \ e\, . . . ,e k ; \X k < 210] = S in 
terms of S' < {^-f + 0{ n - 1 ). 

S = S' [(4B 2 - 0(n)) + AB k I k + AB k (X k - I k - R k - 2B k )] 
+^ [(R 2 k - O(n)) + 2R k I k + 2R k (X k -I k -R k - 2B k )] . 

= S' + [AB 2 k - AB k X k -R\ + 2R k X k ] + 0{n' 1 ) 
< (^) +X i [ABl-AB k X k -R 2 k + 2R k X k ]+0(n- 1 ), 
as desired. □ 
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By Lemma l4.4l \B k and \Rk track b(t) = te -4 *, so Lemma [4 . 5 1 indeed indicates that the differential 
equation (j4|) estimates \X tn - We now prove this formally. Our method uses Hoeffding-Azuma, so we 
need bounded differences. In our proof of Theorem l2.6^ we achieved this by controlling the distribution 
of the component sizes with the result of Spencer and Wormald (Fact 12.51) . 

Recall that a graph has a K, c component tail if for all positive integers s, at most Ke -cs -fraction 
of vertices lie in components of order at least s. In particular, the empty graph has K, c component 
tail with K = e and c = 1. Fact 12.51 then ensures that after a period of random edge addition, the 
resulting graph still has a K',c' component tail. However, the period only lasts for about 0.5n edges 
when starting with the empty graph, and our process needs to run for 1.06n rounds. To work around 
this issue, we use several iterations. 

Define the sequence to, ... , tig, by letting to = 0, and tj+i =U+ j^pj, where x(t) is the solution of 
the differential equation The motivation for this sequence is as follows. Suppose we have already 
established that the blue graph after tjn rounds has a Ki,Ci component tail, and its susceptibility L is 
approximately x(tj), specifically, that L < 1.5x(tj). Then, we could apply Fact 12.51 with L = 1.5x(U), 
K = Ki, c = Ci, and 7 = \, to to conclude that after ij+in rounds, even if all new edges were colored 
blue, the blue graph would still have a Kj + i,Cj+i component tail whp. This allows us to define 
sequences Kq < • • • < K±g = K' and c\ > • • • > c±g = d . We confirmed numerically that tig > 1.06, 
so this would allow us to maintain a K',c' component tail for 1.06n rounds. Now we formalize this 
heuristic, and prove our two-color avoidance theorem. 

Proof of Theorem 11.31 For each < k < 1.06n, let be the event that all of the following hold: 



We only consider k < 1.06n, and x(t) < 209 for all t < 1.06, so if holds, we have < 210. Then 
Lemma 14.51 gives 



= < 




lX k < x{±)+e » n 4, 



and the blue graph has a K',c' component tail. 



We define a supermartingale. Let 




IE [Zk+i — Zk I ei, . . . ,ek,£k] 



< 




Now we estimate each term. Since — < x(-) + e « n 4 and k < 1.06n, we have (— ) 2 < x 2 (-) + 

n ^ — \n/ — ' \ n J — \nJ 

^ X in) e n 71,-3 +0(n _ 2). Similarly, — £ = & 2 (^) +0(n _ 3), and the same estimate holds for — Also, 
l(2R k - AB k ) = -26(|) + 0(n-s), so 
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From Taylor bounds similar to those in the proof of Lemma \4A\ we have xf^-^) — x(^) = h x '(^) + 

_ , n 500(fc+l) 500fc tr nf) 500fc _ ? nr! , i i • / 

(J(n ) and e ™ — e » = — e « + G>(n J. Combining all ol these bounds, and using x 
x 2 + 36 2 — 2bx, the entire estimate simplifies to 

E [-Zjfc+i — -^fc | ei, . . . , efc,£fc] < 

V" / V" 

which is indeed less than zero for large n because 6(t) = ie _4< is always less than 1, and x(t) < 209 
for all t < 1.06. Therefore Zo, . . . , Z\sx,n is a supermartingale. Note that Zq = — n*. Now because we 
are dealing with a graph with a K 1 , d tail, just as in the proof of Theorem 12.61 we have \Zk+i — Z\^\ = 
0(log 2 n) and then the Hoeffding-Azuma inequality implies that for each k < 1.06n, 



•2,. ( ^ + 26 ( - ] 500 

n I \n 



e n n 4 + c>(n 3 



' [Z fe > 0] < e 



-SXn 1 / 2 /^ 4 n) 



Therefore, by a union bound, whp < for all k < 1.06n. Also, Lemma 14.41 implies that whp, 
~ ^(n)l — e ^ n ~^ an< i l - ^ — Kn)l — e&n ~^ ^ or ever Y k < 1.06n. Let £ be the conjunction of all 
of these high-probability events. 

To complete our argument, we show by induction that whp, for each < i < 19, the blue graph 
after tin rounds has a Kj,Cj component tail. The base case i = is trivial. For the induction step, 
suppose that it is true for i. Condition on the blue graph after tin rounds having a Ki,Ci component 
tail, as well as on the event £ that all Z^ < and all B^, are concentrated. Then, the same argument 
as in the conclusion of the proof of Theorem 12.61 forces all £k to occur for k < tin, since Ki < K' 
and Cj > d . In particular, £t in already implies that after i,n rounds, the blue graph has susceptibility 
\X Un < x{U) + o(l) < l.bx(ti). Applying Fact [23] with L = l.5x(U), K = K { , c = a, and 7 = I 



we 



see that whp, even if all new edges were colored blue, the blue graph after Un+ (l — \) 2-1 = U+in 
rounds would have a ifj+j, Ci+i component tail. This finishes the induction, so whp the blue graph 
after tign > 1.06n rounds has a K',d component tail. In particular, all connected components are of 
order O(logn), so there is no giant in the blue color class. The same result follows for the red color 
class by symmetry. □ 



5 Online creation of giants 

Recall that the trivial bounds for the online creation of giants are as follows. No algorithm can create 
giants in all colors in fewer than (1 — e)^ total edges, because that is not even enough to make a 
giant in the uncolored graph. On the other hand, if one randomly colors each incoming edge, then 
monochromatic giants will appear after (r + e)^ total edges. In this section, we prove Theorems 11.41 
and 11.51 which improve the above trivial lower and upper bounds for the online creation of giants. 



5.1 Lower bound 

The previous argument iterated Fact 12.51 to maintain the component tail property, using a customized 
argument to control the susceptibility for a specific algorithm. In this section, we need to consider an 
arbitrary coloring strategy, so we use our general-purpose tool (Theorem 12. 6p to control the suscepti- 
bility. This will establish a lower bound of f2(nlogr) for the number of edges required to create giants 
online in each of r color classes. We need the following simple bound for random graphs. 
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Lemma 5.1. Let X be a constant. The random graph G UjP with p = ^ contains at most o(j^|-^) cycles 
of length at most \/log n, whp. 

k i \k 

Proof. The expected number of cycles of length k in G„ )P is at most r^p = ^j, so the expected 
number of cycles of length at most ylogn is below J2k=F^ ^ ^ — 1> this * s below yTogn. 
Otherwise, it is below yTogra • A^ log ™. In both cases, the conclusion follows from Markov's inequality. 

□ 

Next, we need a worst-case bound on how large the susceptibilities of different color classes can be 
when a graph is colored. 

Lemma 5.2. Let K,c be positive real constants. Let G be an n-vertex graph with a K,c component 
tail. Also assume that G contains o(j^^) cycles of length at most \f\ogn. Consider any 2-coloring 
of the edges of G, and let G^ and G^ be the n-vertex subgraphs of G obtained by keeping only edges 
in the first or second color, respectively. Then 5(G^) + S(G^) < 5(G) + 1 + o(l). 

Proof. Each component of is entirely contained within a component of G, so we may break 
down the left hand side by components of G. Consider first the components of G which are larger 
than -y/log n. Since G has a K, c component tail, the number of vertices in such components is at most 
Ke~ c ^ a%n n. The component tail also implies that there is some constant C such that all components 
of G are bounded by Clogn. Since 5(G«) + S(G^) = \ + cP), where eft is number of 

vertices in the component of G« containing v, the total contribution from vertices in components of 
G with order at least \T°g n is only ^ • Ke~ c ^ Q%n n ■ 2Clogra = o(l). 

Next, consider the components of order at most ^/^ogn which contain cycles. Since the suscepti- 
bility is — times the sum of squares of component sizes, each component of this type contributes at 
most i • 2(- v /log n) 2 to S(G^) + S(G^ 2 '). By assumption, G only has o(j^^) cycles small enough 
to fit into these components, so the number of such components is at most o(i^j)> Therefore, their 
total contribution to S{G^) + S(G^) is at most ± • 2(VTog^) 2 • o(^) = o(l). 

The main contribution comes from the remaining components, which are all trees. Any tree T 
in G contributes xtX^ftI-^I ^° We claim that it contributes at most xtX^ptO-^I *° 

5(G ( - 1 * ) ) + S(G^), i.e., the additional amount is at most ^\T\. Indeed, T's contribution to 5(G^) is 
precisely ^ times the sum of the sizes of the color-z components that contain each vertex v 6 T. Trees 
have the property that each pair of vertices is connected by a unique path, so we can express the size 
of the color- i component containing v as Y2weT ^v}wi where the indicator is 1 if the unique path 
between v and w is monochromatic in color i, and otherwise. Hence, the total contribution of T to 
5(G^) + S(G^) is ~ Ylv weT(Iv% + Iv%)- Since T is a tree, the only time both indicators Iv% can be 

1 is when w = v. So for each v, the sum YlweT^^w + Iv%) is at most \T\ + 1, as claimed. Summing 
over all tree components, we see that their total contribution to 5(G (1) ) + 5(G (2) ) exceeds 5(G) by 
at most ^ times the sum of the sizes of tree components, which is at most 1. Combining this with 
the contributions from non-tree components above, we obtain 5(G«) + S(G^) < 5(G) + 1 + o(l), 
as desired. □ 

Now we proceed to prove Theorem 1 1.41 using the previous two lemmas, and Theorem 12 .61 to control 
the evolution of susceptibility. We will show that for any r which is a power of two, whp no online 
algorithm can create giants in all r colors within (clog 2 r)n edges, where c ~ 0.043. This clearly 
implies the desired asymptotic bound. Our calculated bound for r = 2 will follow as a special case. 
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Proof of Theorem 11.41 Let Co be the set of all r = 2* colors. Let 7 be a constant parameter 
which we will specify later. The graph is initially empty, with susceptibility Lq = 1. By Theorem 12.61 
after (1 — 7)^Lq 1 edges, the graph formed by the union of those edges has K\,c\ component tail and 
susceptibility at most — a + o(l) whp. Arbitrarily divide the colors into two groups of size 2* _1 each. 
Lemmas 15.11 and 15.21 ensure that no matter how the edges were colored, one of the two color groups 
determines a graph G\ with susceptibility at most L\ + o(l), where L\ = ^(l + -S-)- Note that G\ 
still has Ki, ci-component tail, and let C\ be the set of 2* _1 colors we picked. 

We iterate this procedure a total of t times. For example, in the next step, we advance by 
(1 — 7)^Lj~ 1 more edges. Even if all of them received colors in C\ (i.e., were added to G\), the 
susceptibility of the graph determined by Ci-colors is at most — + o(l) whp, by Theorem 12.61 
Arbitrarily divide the colors of G\ into two groups of size 2 t ~ 2 each. Again by Lemmas 15.11 and 15.21 
one of the two color groups, say C 2 , determines a graph G2 with susceptibility at most L 2 + o(l), 
where L 2 = ±(l + ^). 

After t iterations, we conclude that there is some single color c such that the graph Gt determined 
by all edges of color c has a Kt,ct component tail and susceptibility at most Lt- A final application of 
Theorem 12.61 implies that we can add fC^" 1 -e) more random edges and still have all components in 
color c of order O(logn) whp. 

It remains to count the total number of edges which we have accumulated. The relationship 
between the Lj's is L i+1 = ~(l + -rr) = \ + ^z, so 



L = 1, 

1 1 
L\ = — I , 

2 2 7 ' 

r 111 

2 47 47 2 
11 11 

3 ~ 2 + 4t" + 87^ + 87^ 



and in general, 



111 



2 2(2 7 ) 2(2 7 ) 2 2(27)*-! (27)* 

1 1 
< ! + — + ••• + 



2 7 (2 7 )* 
1 



Thus, the total number of edges added (not even counting the final step) is at least 

t-i 



(1-7)2 E^r 1 > (1 -^2- i ( 1 -2^J 

i=0 \ / / 



By routine calculus, the optimal choice for 7 is giving (1 — 7)(l — ^) = § — v2 0.086. 
Since t = log 2 r, we indeed see that whp, no online algorithm can create giants in all colors within 
0.043nlog 2 r edges. This completes the proof of the asymptotic bound. 
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For the specific case of r = 2 colors, we can add the final batch of ^{L^ 1 — e) random edges (here 
t = 1) to get a specific bound which beats the trivial bound of n/2 edges. Since L\ = ^(l + -), this 
gives a total edge count of 



(1 



^2 + 2^ 



n 
2 

n 
2 



1 



(l- 7 )+(-(l + - 



(1-7) + 



2 

27 



7 + 1 



By routine calculus, the optimal choice for 7 is \p2 — 1. Therefore, whp, no online algorithm can 
create giants in both colors within (2 — — e)n edges, as claimed. □ 



5.2 Upper bound for many colors 

In this section, we present an online coloring algorithm which creates giants in all r color classes within 
roughly \yfr edges. The strategy is based on the classical fact that there are infinitely many values 
of r such that the edges of K r can be perfectly partitioned into cliques of order roughly ^fr. 

Fact 5.3. Let r = q 2 + g + 1 for some prime power q. The edges of K r can be partitioned into disjoint 
sets Ei, . . . ,E r such that each Ei is precisely the edge set of some clique of order q + I. 

Proof. The projective plane of order r = q 2 + q + 1 is the finite geometry where points and lines 
correspond to dimension-1 and dimension-2 subspaces of F^, respectively. This object contains exactly 

q q Z\ = Q 2 + Q + 1 points and the same number of lines, and has the property that every pair of distinct 
points determines a unique line. 

Identify the vertices of K r with the points of the projective plane. Let the q + 1 vertices of the 
clique corresponding to E^ be the points contained in the z-th line of the projective plane. The edge 
partition property is then equivalent to the incidence property of the projective plane. □ 

We also need the giant component threshold in certain inhomogeneous random graph models, 
where the edge probability is not uniformly p at all (2) possible sites. Instead, the probability of each 
edge depends on the locations of its endpoints. Bollobas, Janson, and Riordan recently completed a 
far-reaching study of phase transitions in these types of inhomogeneous models in F7j. We use a special 
case of their work, regarding the specific model below. 

Fix a symmetric kxk matrix A = iflij). Let G Ut A be the n- vertex random graph defined as follows. 
Split the n vertices into k groups of size n/k. Between each pair of distinct vertices, say from the i-th 
and j-th groups (where i may equal j), place an independent random edge with probability — . Note 
that when A = cJ^, where is the kxk all-ones matrix, G nt A is the Erdos-Renyi random graph G n ^ p 
with p = f . 

The following result was proved as Theorem 3.1 of [J]. Here, the L2 operator norm ||-B||2 of a k X k 
matrix B is sup{||-Bx||2 : ||x||2 = 1}, and the 2-norm of a vector (x\, . . . ,Xk) is \JY1 x 1- 

Fact 5.4. Let A = (a^) be a symmetric kxk matrix, and let A be its normalization inf)- If\\A\\2 > 1, 
then G n ^A contains a giant component whp. 

Remark 1. In the same theorem, Bollobas, Janson, and Riordan also proved the complementary 
result that when ||^4||2 < 1, the largest component of G n ^A is o(n) whp. However, we do not need this 
part for our analysis. 
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Remark 2. The L2 operator norm of a real symmetric matrix A always equals its spectral radius 
p(A), which is the maximum |Aj| over all eigenvalues Aj. Indeed, A is diagonalizable with an orthonor- 
mal basis of real eigenvectors, so let the eigenvalues and eigenvectors be Ai,...,A^ and v\, . . . ,Vk, 
respectively. Expressing any vector x in this basis as ^Cjfj, we have that the condition ||x||2 = 1 

is precisely ^ c? = 1 , and ||Ae|| 2 = \JY1 ^h ]- Therefore, ||Ae||2 has maximum value equal to the 
largest absolute value of an eigenvalue. 

Remark 3. The Erdos-Renyi model G„ )P with p = — corresponds to G n ^A with A = cJt- The 
normalized matrix A = %Jk nas eigenvalues c and 0, so Fact 15.41 implies the classical result that the 
giant component appears after p =\. 

(k) 

We use this to study the A;-partite random graph Gn,p, which has n vertices split into equal groups 
of size j, and independent random edges with probability p = ^ between pairs of vertices from distinct 
groups. In the above framework, this is G n ^A with A = c(Jk — Ik)- 

Corollary 5.5. Let k > 2 be a positive integer, and let c > -£r be a real number. Then the k-partite 

Ik) 

random graph Gn lP with p = ^ contains a giant component whp. 

Proof. By Fact 15.41 and our second remark, the problem reduces to determining the eigenvalues of 
A = I (Jfc — Ik). These are precisely %(k — 1) and #(0 — 1), so since k > 2, the giant component 
appears once c > -^j. □ 

We are now ready to state our algorithm and prove its effectiveness. Note that a coloring algorithm 
that produces giants in r colors trivially gives coloring algorithms for any r' < r as well, simply by 
using the first color whenever any color beyond r' was to be used. So, Theorem 11.51 is a consequence 
of the following more precise formulation, combined with the Prime Number Theorem and Fact 12.31 

Theorem. Let r = q 2 + q + 1 for some prime power q. There is an online algorithm such that for any 
e > 0, whp all r color classes contain giant components within (- + e)§ edges. 

Proof. Arbitrarily partition the n vertices into r sets Vi, . . . ,V r , each of size -. By Fact 15.31 there is 
a partition E\ U . . . U E r of the edges of K r , such that each E t is precisely the edge set of some clique 
of order q + 1. Our online coloring algorithm is then as follows. Usually, the incoming edge will have 
endpoints in distinct parts Vi and Vj. In that case, color the edge with the index t of the E t which 
contains the edge ij in the partitioned graph K r . Otherwise, if the incoming edge is spanned by a 
single Vi, then discard the edge entirely. Note that this is even stronger than coloring it, because we 
will now find giants without using those edges at all. 

Our algorithm disregards the entire history of the process, since the color of each edge is a function 
of the locations of its endpoints. In particular, the order of the edges is irrelevant, so the performance 
only depends on the final edge set. Thus, by Fact 12.31 it suffices to show that if this strategy is applied 
to G n ,p with p = (5 + 5)^) then it creates giants in all colors whp. By passing to this independent 

model, each color class itself becomes a (g+l)-partite random graph G^, + ^\ on only n' = ^(q+1) 
vertices. Indeed, E t is the edge set of a clique on some set S of q + 1 vertices of K r , so the edges that 
receive color t are precisely those with endpoints in some Vi and Vj with i 7^ j and i,j £ S. 

Finally, we can apply Corollary 15.51 with k = q+ 1, since p = ^7 with d = f - + s) r ■ > 
Therefore, each individual color class contains a giant component whp. Taking a union bound over 
all r (finitely many) color classes finishes the proof. □ 
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5.3 Upper bound for 2 colors 

To adapt our strategy from the previous section to the case r = 2, we must specify symmetric 0-1 
matrices A\ and A2 which sum to the k x k all-ones matrix We then split the vertices into k equal 
parts Vi, . . . ,Vk, and color an edge with endpoints in some Vi, Vj with color 1 if the ij-entry of A\ is 
1, and color 2 otherwise. 

Then, after applying this strategy to the edges of G„ )P with p = £ the i-th color class is a copy 
of G n c j^. By the second remark after Fact 15.41 this contains a giant component when the spectral 
radius p(%Ai) exceeds 1. Since our objective is to create giants in both colors as rapidly as possible, 
we want to select A\ and A2 such that A\ + A2 = Jk, but mm{p(Ai), p(A2)} is as large as possible. 
This appears to be a nontrivial problem, but one simple way to choose the matrices is to let A\ have 
l's in the top-left t X t submatrix, and 0's everywhere else. This leads to the following bound. 

Proposition 5.6. For every e > 0, it is possible to create giants in two colors online within (| + e)n 
rounds whp. 

Proof sketch. Since Ai is just Jt embedded in an all-zeros matrix, its spectral radius is precisely 
t. Next, note that A2 = Jk — A\ has rank 2, so it has at most 2 nonzero eigenvalues Ai,A2- The 
trace of A2 is k — t, so Ai + A2 = k — t. Also, the main diagonal of A\ has its first t entries equal to 
k — t, and the remaining k — t entries equal to k, giving tr(^4j) = t{k — t) + (k — t)k = k 2 — t 2 . This 
trace also equals X 2 + A|, because the nonzero eigenvalues of A\ are X 2 and \\. Solving this system of 
equations, one finds that the largest eigenvalue of A2 is \(k — t + \/k 2 + 2kt — 3t 2 ). Recall that the 
largest eigenvalue of A% is t, and we wanted the largest possible min{p(Ai), p(A2)}. Routine calculus 
shows that the optimal choice of t is giving both p(A{) = So, we choose the particular 3x3 
matrices 

/ 1 1 \ / 1 \ 

Ax = 1 1 I , A 2 = 1 I . 

\ / \ 1 1 1 / 

Therefore, as we remarked at the beginning, Fact 15.41 shows that when this strategy is applied to G ntP 
with p=£j both colors will contain giant components if their spectral radii p(%Ai) exceed 1, i.e., once 
c > |. By Fact 12.31 this happens after (| + rounds, so we are done. □ 

Remark. Although the partition we chose may appear naive, there is evidence to suggest that it 
may be optimal. Note that if we ignore the main diagonal (an effect that can be made negligible by 
choosing large k) and seek A\ + A2 = Jk — Ik, then A\ and A2 are the adjacency matrices of a graph 
and its complement. 

Several researchers have studied the question of bounding the sum of the spectral radii of the 
adjacency matrices of complementary graphs (see [HI Q31 [191 12Q1 [211 EZ])- In particular, Nikiforov 
recently conjectured in [19] that the sum of these two spectral radii is always at most |fc + 0(l), where 
k is the number of vertices. If true, this would imply that mm{p(Ai), p^A 2 )} < |fc + 0(1), which our 
construction achieved. In fact, in his extremal example, one graph was a clique on a subset of the 
vertices, which is essentially the same as our construction. So, perhaps |n is the limit of what can be 
achieved by any strategy as above. 

Next, we prove Theorem 11.61 which shows that by making the strategy more adaptive, one can 
create giants even faster. The algorithm in the proof of Proposition 15.61 fixed a subset R of vertices 
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in advance, and used the first color whenever an edge was spanned by R. The key idea is to let the 
subset R depend on the outcomes of the first few rounds. To analyze this strategy, we will need two 
results of Spencer and Wormald. These are Theorems 1.1 and 3.1 of [23], restated in equivalent form 
using Fact E3J 

Fact 5.7. Let < t < | be a fixed parameter. Then there exist constants K,c such that whp, the 
graph on n vertices formed by tn independent random edges has susceptibility jzm + o(l), and a K,c 
component tail. 

Fact 5.8. Let L, K, c, e be positive real numbers. Let G be a graph on n vertices with a K,c component 
tail and S(G) = L. Then, after adding (l + e)& more independent random edges, the resulting graph 
contains a giant component whp. 



Proof of Theorem 11.61 Let the colors be red and blue. We state the coloring strategy in terms of a 
constant parameter t, which we can optimize at the end. (The best choice turns out to be t ~ 0.189.) 
For the first tn rounds, color all edges red. Then, permanently fix R to be the set of all vertices 
incident to a red edge at that time. Color each future edge red whenever both endpoints lie in R, and 
blue otherwise. 

Let a = The argument at the beginning of the proof of Lemma [4.41 shows that a = (1 — e~ 2t + 
o(l)) whp. Let us analyze how many rounds are required for a red giant to appear. By Fact 15.71 the 
(completely red) graph G at time tn has susceptibility S(G) = j^m + whp, so the sum of the 
squares of its component sizes is (j~2i + °(l)) re - Let Gr be the subgraph of G induced by R. The 
sum of the squares of the components in Gr is precisely S(G)n — (1 — a)n, because all components of 
G outside R are singletons. Therefore, since Gr has an vertices, its susceptibility L is: 

L = —[S(G)n-(l-a)n] = (l + o(l))~ 
an a 

Then, by Fact 15.81 whp the red graph will contain a giant component after (1 + e)^|; more random 
edges are added with both endpoints in R. By a standard coupling as in Fact 12.31 this happens 
after (1 + e)^ • a~ 2 more rounds whp, since each incoming edge falls within R with probability a 2 . 
Substituting \R\ = an, we find that a red giant appears after a grand total of tn + (^aL e ) n = 
[t + | [jzm ~ e~ 2t ) 1 + e]n rounds whp. 

To analyze the blue graph, observe that by a similar coupling to Fact 12.31 after in + (l + e)^ rounds 
the blue graph contains G n ,cA whp, where A is the n x n matrix with O's in the top-left \R\ x \R\ 
submatrix, and l's everywhere else. Plugging \R\ = an into the eigenvalue calculation from the proof 
of Proposition [5761 we see that the largest eigenvalue of A is § (l — o + y/l + 2a — 3a 2 ) . Thus, Fact 15.41 
implies that whp, the giant component appears in the blue graph once c surpasses - a+ /^ 2a 3q , 2 + e i 
i.e., when the total number of rounds exceeds tn -\ } +t + n. 

Since a = 1 — e~ 2t + o(l), it is now routine to numerically optimize t. It turns out that the best 
choice is t ~ 0.189, which gives a ~ 0.314. Then, both of the bounds at the ends of the previous two 
paragraphs are satisfied after 0.733n rounds, completing the proof. □ 
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6 Concluding remarks 



In this paper we have introduced several rather natural algorithmic variants of the classical problem 
of the appearance of the giant component in a random graph/process. As expected, the offline cases 
of these problems appear to be much more accessible, and indeed we managed to solve both the 
avoidance and the embracing versions asymptotically for any fixed r. The online case seems to be 
more challenging; there we showed that in all cases one can do better than the trivial algorithms that 
randomly color each incoming edge, but for creating giants, rather sizable gaps remain. 

It would certainly be nice to settle the case of two colors for creating and avoiding giants online 
in both color classes, but that could be difficult. A more approachable problem might be to close the 
asymptotic gap between the lower bound of fi(logr) • n and the upper bound of 0(y/r) • n for the 
question of creating giants in r colors. In particular, can one show a lower bound of the form r a n for 
some positive constant a? 

Another, perhaps more technical, issue that we would like to see settled is the nature of an algorithm 
for avoiding giants online. Our online avoidance algorithm is randomized. Is there a deterministic 
strategy that matches its performance in the online setting? 
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